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Abstract 

Given  a planar  straight  line  graph  G with  n vertices  and  a point  PQ, 
locating  PQ  means  to  find  the  region  of  the  planar  subdivision  induced 
by  G which  contains  PQ.  Recently,  Lipton  and  Tarjan  presented  a brilliant 
but  extremely  complex  point  location  algorithm  which  runs  in  time  O(logn) 
on  a data  structure  using  0(n)  storage.  This  paper  presents  a practical 
algorithm  which  runs  in  less  than  6 T log^nl  comparisons  on  a data  structure 
which  uses  O(nlogn)  storage,  in  the  worst  case.  The  method  rests  crucially 
on  a simple  partition  of  each  edge  of  G into  O(logn)  segments. 
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1.  Introduction 


The  problem  of  locating  a point  in  a planar  subdivision  - briefly 

called  "point  location"  - is  quite  important  in  computational  geometry  and 

has  received  considerable  attention  in  the  recent  past.  It  is  stated  as 

follows!  Given  a oonnected  planar  straight-line  graph  G on  n vertices  and  a 

point  Pq,  find  which  region  of  the  planar  subdivision  Induced  by  G contains  Pq. 

An  early  solution  to  this  problem  was  proposed  by  Dobkln  and  Llpton  [l], 

whose  location  algorithm  runs  in  time  O(logn)  on  a data  structure  which  uses 
2 2 

0(n  ) space  and  can  be  built  in  0(n  ) time.  More  recently  Lee  and 

2 ( 1 ) 

Preparata  [2]  [3]  developed  an  O(log  n)^  ' time  location  algorithm  on  a 
data  structure  constructed  in  O(nlogn)  time  and  using  0(n)  space.  Observing 
the  trade-off  between  space/^reprocessing  on  one  side  and  search  time  on  the 
other,  Shamos  [4]  raised  the  question  of  whether  O(logn)  search  time  was 
achievable  with  less  than  quadratic  storage.  This  issue  was  definitively 
settled  by  Lipton  and  Tarjan  [5  ] who  showed  that  the  point  location  problem  - 
called  by  them  "triangle  problem"  - could  be  solved  in  O(logn)  time  on  a 
data  structure  which  uses  0(n)  space  and  can  be  constructed  in  time  O(nlogn). 
Their  brilliant  method,  which  is  based  on  a theoretically  far-reaching 
planar  separator  theorem  [6],  is,  however,  algorithmically  extremely 
complicated;  to  quote  Lipton  and  Tarjan  themselves,  "...  this  algorithm 
[is  not  advocated]  as  a practical  one,  but  its  existence  suggests  that  there 
may  be  a practical  algorithm  with  O(logn)  time  bound  and  0(n)  space  bound". 

^^All  logarithms  in  this  paper  are  to  the  base  2. 
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The  result  presented  in  this  paper  comes  very  close  to  providing  a 

complete  substantiation  of  the  above  conjecture;  specifically,  we  shall  exhibit 

a practical  point  location  algorithm  which  runs  in  O(logn)  time  on  a data 

structure,  which  can  be  constructed  in  O(nlogn)  time,  but  which  uses 

O(nlogn)  space  rather  than  just  0(n). 

Our  method  could  be  viewed  as  an  evolution  of  the  original  technique  of 

Dobkin  and  Llpton  [l],  which  we  now  briefly  review.  A horizontal  line  is 

drawn  through  each  vertex  of  G,  thereby  slicing  the  plane  into  horizontal 

strips  called  "slabs";  each  slab  contains  no  vertex  of  G and  is  subdivided  by 

the  transversal  edges  into  an  ordered  set  of  0(n)  regions.  Point  location  is 

accomplished  by  first  searching  the  horizontal  lines  to  locate  a slab  and  by 

next  searching  the  segments  crossing  the  slab  to  locate  a region.  Clearly 

this  search  is  carried  out  in  O(logn)  comparisons,  but  since  an  edge  is 

2 

partitioned  by  0(n)  horizontal  lines,  0(n  ) storage  is  used.  In  contrast,  our 
method  interleaves  tests  against  horizontal  lines  and  test  against  edges;  thus 


it  will  not  be  necessary  to  decompose  the  edges  in  0(n)  portions.  In  particular, 
the  method  rests  crucially  on  the  observation  that  each  edge  of  G can  be 
decomposed  uniquely  into  O(logn)  segments. 
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2.  Logarithmic  segmentation  of  edges 

Let  a point  v in  the  plane  (x,y)  be  given  as  a pair  of  coordinates  x(v)  and 
y(v)  and  let  [vQ, . . . ,v  } b«  the  vertex  set  of  G,  where  the  numbering  is 
such  that  y(Vjj)  ^ y(v^)  < ...  < y(vR_^).  (In  the  sequel  we  shall  assume  for 
simplicity  that  these  ordinates  are  distinct;  the  details  of  the  general 
case  are  straightforward.)  For  additional  simplification  and  without  loss  of 
generality  we  may  asstime  that  y(v^)  * i;  so,  when  we  say  that  the  ordinate  of 
a point  u is  i we  mean  y(u)  - y(v^). 

Each  edge  is  to  be  partitioned  into  a collection  of  segments;  each  of 
these  segments  will  be  simply  denoted  by  the  ordered  pair  of  ordinates  of  its 
extremes.  The  set  of  pairs  of  ordinates  delimiting  segments  is 

k k r 

S ■ {(2  j ,2  (j+l))|j,k  nonnegative  integers}.  We  want  to  partition  each  edge 
into  a minimal  number  of  such  segments:  for  example,  edge  (9,21)  will  be 
partitioned  into  (9,10)(10,12)(12,16)(16,20)(20,21). 

For  any  given  pair  of  nonnegative  Integers  m and  r we  define  the  set 
S(m,r)  - a subset  of  S - as  follows: 


S(m,r)  - £(2kj,2k(j+l))|2rm  < 2kj ,2*( j+1)  < 2r(m+l)}.  The  elements  of 
S(m,r)  are  organized  as  the  nodes  of  a full  binary  tree  D(m,r)  as  shown  in 
figure  la  (a  similar  structure  has  been  called  range  tree  by  Bentley  [11]). 
In  figure  lb  we  show  the  complete  tree  D(0,3). 


2rm,2r(m+l) 


D(2m,r-1)  D(2m+l,r-l)  (C 


7}  7,8 


Figure  1.  Definition  of  tree  D(m,r)  and  illustration  of  D(0,3) 
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Given  an  edge  (vi,vJ),  with  0<  i < J < n-1,  we  can  now  produce  the  desired 
logarithmic  segmentation  of  it  with  the  aid  of  the  tree  0(0,  !"iog(n-l)l  ), 
simply  referred  to  as  D.  This  is  accomplished  by  the  following  algorithm 
SEGM((i, j),D)  (here  Xq.X^,  and  ^2  are  *iat9»  and  denote8  li8t  concatenation). 
The  segmentation  is  performed  on  the  vertical  projection  (i,j)  of  (v^v^). 

SEGM((i,j),D) 


begin 


V ♦-  R00T(D) 

(p»q)  «-  (l(V),r(v)). 

l£  (ifj)  ■ (p.q)  Chen-1  *-  (P.q) 

else  If  j < (p+q)/2  then  X «-  SEGM((i,J),  LEFTSUBTREE (V ) ) 

else'  If  i 2 (p+q)/2  then  £ «-  SEGM( ( i , j ) .RIGHTSUBTREE (V ) ) 


else  begin  Zl  «-  SEGM((i, (p+q)/2) , LEFTSUBTREE (V ) ) 
X,  ♦-  SEGM(((p+q)/2,J),  RIGHTSUBTREE(V) ) 


X «-  Xj^  o X2 


end 


end 


return  X 


For  example,  SEGM((1,7),  D ) produces  X - (1,2) , (2,4) , (4,6) , (6,7) . The 
action  of  SEGM  can  be  viewed  as  tracing  two  paths  - possibly  with  a common 
initial  subpath  - from  the  root  of  D to  two  of  its  leaves.  The  number  of 
recursive  calls  is  therefore  at  most  twice  the  depth  of  D,  and  since  each 
new  call  takes  time  bounded  by  a constant,  SEGM  runs  in  time  O(logn). 


[j 
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n 


We  now  state  without  proof  properties  which  follow  directly  from  the 
algorithm  SEGM: 

Proposition  1. 

An  edge  (i,j)  with  0^  i < j < n-1  is  partitioned  by  SEGM  in  at  most 
2l"iog(n-l)n  -2  segments; 

Proposition  2. 

Let  (h,k)  be  a segment,  with  h < k (h  and  k are  the  ordinates  of  the 
two  extremes  of  the  segment).  If  h ■ 2r«h*  with  h1  odd,  then 
k€  [2r*h'  + 2*1  i-0,l,...,r.}. 

3.  Construction  of  the  point  location  tree 

We  shall  now  construct  the  data  structure  J - a binary  search  tree  - 
to  be  used  by  the  point  location  algorithm. 

Without  loss  of  generality  we  may  assume  that  the  given  planar  straight-line 
graph  G with  n vertices  be  a triangulation;  if  not,  G can  be  transformed  into  one 
in  time  O(nlogn)  by  adding  edges , according  to  the  algorithm  of  Garey  et  al.  [7]. 

The  graph  G is  also  assumed  to  be  given  as  a collection  of  ordered  edge 
lists;  specifically,  we  let  6^  - {(j,i)|(j,i)  is  an  edge  of  G and  i > j]  and 
we  assume  that  the  members  of  6 ^ are  ordered  clockwise  around  v^. 

This  representation  is  obtainable  in  time  O(nlogn)  from  the  more 
conventional  representation  [Edges  incident  on  v ^ | j - 0,...,n-l}. 
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A preliminary  task  is  the  logarithmic  segmentation  of  the  edges  of  G. 

Let  G have  m edges.  Each  edge  e of  G is  partitioned  into  a string  of  segments 
by  means  of  the  algorithm  SEGM  oucj-ined  in  the  previous  section.  With  each 
segment  t we  associate  an  integer,  height(t),  which  is  the  ordinate  of  its 
upper  extreme.  The  string  of  segments  of  e is  stored  as  an  ordered  list 
(tl*t2» • • • »tr) » where  the  order  is  such  that  height(t^)  < height(t2>  < ... 

< height(tr):  t^  and  t^  are  respectively  the  initial  and  terminal  segments 
in  the  list.  Since  each  edge  can  be  partitioned  in  time  O(logn)  and 
m<  3n-6,  the  entire  edge  segmentation  task  runs  in  time  O(nlogn)  and  uses 
O(nlogn)  space. 

The  procedure  which  constructs  the  data  structure  J is  called  ORGANIZE 


and  has  access  both  to  the  set  of  lists  ■ 0,...,n-2}  and  to  the  m ordered 


lists  of  segments.  Specifically,  it  starts  with  the  initial  segments  of  the 
edges  issuing  from  v^,  processes  them  and  proceeds  by  acquiring  the  "upward 
continuation(s)"  of  each  of  the  processed  segments.  This  is  easily  done  as 
follows,  where  we  assume  that  segment  t is  contained  in  some  list  o.  We  also 
denote  by  L(t)  a list  of  segments  which  are  the  upward  continuations  of  t*,  L(t) 
is  referred  to  by  a pointer  b(t)  associated  with  t.: 


If.  t is  terminal  in  a (Comment:  t reaches  vertex  vjjeight(t)  t^ien 


kggiP  e *~eheight(t) 


^height  (t)*"  A(2) 


If  £ + A then  L(t)  *-  string  of  initial  segments  of  edges  in  £ 
else  L(t)  •-  A 


end 


else  L(t)  «-  successor  of  t in  a 


(2) 


Here  and  hereafter,  A denotes  the  empty  sequence. 


mjL 


T 
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The  procedure  also  makes  use  of  two  auxiliary  functions,  JOIN  and  BALANCE, 
to  be  discussed  in  detail  later:  presently,  suffice  it  to  say  that  JOIN 
joins  together  two  binary  trees  by  providing  a common  root,  while 
BAIANCE  structves  a forest  of  binary  trees  into  a single  binary  tree. 

We  shall  now  informally  describe,  and  illustrate  with  an  example,  the 
procedure  ORGANIZE  (£,h,k),  where  h and  k are  integers  (h  < k)  and: 

(i)  £ is  a string  of  segments,  which  have  the  properties  that  the 

ordinates  of  their  lower  extremes  are  identical  and  equal  to  h, 
while  the  ordinates  of  their  upper  extremes  are  no  greater  than  k 
(descriptively  we  say  that  the  segments  in  £ are  contained  in  the 
horizontal  slab  [h,k]). 

(ii)  Either  k - n-1  or,  letting  h = h'  • 2r  (odd  h'),  k = h + 2^ 
for  some  0 < 1 < r. 

Notationally,  for  some  terms  a^, . . . ,a^,  (a^, . . . ,a^)  denotes  a string, 
while  (a^,...,ar)  denotes  a binary  tree  so  that  the  string  (a^...,ar)  is  obtained 
when  the  tree  is  traversed  in  inorder  ([8],p.  ).  If  A^  and  are  two  strings, 

I T • 

(A^,A2)  is  their  concatenation.  The  procedure  also  makes  convenient  use  of 
stacks  £,S,U,  B ; following  [8  ],  for  a stack  S,  "x  e S"  denotes  that  x 
is  the  element  which  has  been  "popped",  while  "S  <=•  x"  denotes  that  x has 
been  "pushed"  into  the  stack.  When  a string  is  stored  in  a stack,  its  left- 
most term  is  at  the  top  of  the  stack. 
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ORGANIZE  (X,h,k)(see  Comment  1.  below) 


1.  begin  S - A,  J - k,  jQ  - 0 

2.  If  h k then  (see  Comment  2) 

3.  While  jQ  < k do  (see  Comment  2) 

4.  begin  While  X * A and  height (TOP(X ) ) < j do 

5.  begin  x * X 


6. 


7. 


8. 


9. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


j - height(x) , 

Form  L(x)  and  b(x)  (see  Comment  3) 

S « (x,b(x)) 
end  (see  Comment  4) 

While  height(TOP(S) ) - j do 
begin  (x,b(x))  «*  S 
U x 
£«  L(x) 

end  (see  Comment  5) 

«-  BAIANCE(U)  (see  Comment  6) 

^jb^))  - ORGANIZE  (0,  j ,min(2j -h, k) ) (see  Comment  ) 
_If  t A then  a *-  JOIN  (a^,a2),b(a)  — b(a2> 
else  a «-  a^,  L(a)  •-  6 (see  Comment  8) 

S « (a,b(a)) 

j0"  j 

j •-  min  (2j-h,k)  (see  Comment  9) 

end 

return  S 


end 
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Comment  1,  [Referring  to  the  graph  in  Figure  2,  we  consider  ORGANIZED, 4, 8); 
segments  are  indicated  by  means  of  integers  and  X ■ (15,16, 
17,18,19,20,21,22,35);  TOP(X)  is  segment  15.] 

Comment  2.  The  major  controls  of  the  algorithm  are  embodied  by  Steps  2 

and  3.  Obviously,  if  h « k,  the  horizontal  slab  is  empty  and  the  empty 
tree  is  returned  (Steps  2 and  19).  Moreover  - as  we  shall  see  (Comments 
8 and  9)  - processing  is  completed  when  the  control  variable  becomes 
equal  to  k. 

Comment  3.  The  string  L(x)  of  the  upward  continuations  of  segment  x is 
constructed  and  referred  to  via  b(x),  as  previously  outlined. 

Comment  4.  Loop  4-8  finds  the  longest  prefix  X*  of  string  X so  that  the 
terms  of  (S,X*)  have  nonincreasing  heights;  X*  is  removed  from  X and 
concatenated  with  S.  Specifically  two-field  records  (x,b(x))  are  entered 
into  S.  [In  our  example,  S becomes  (19,18,17,16,15).] 

Comment  5.  Loop  9-12  finds  the  longest  suffix  of  S of  elements  with  constant 
heights,  removes  it  from  S and  places  it  into  a stack  U.  Also  for  each 
x transferred  from  S to  U the  list  L(x)  (pointed  to  by  b(x))  of  the 
upward  continuations  of  x is  p’aced  into  a stack  B.  [in  our  example, 
at  this  point  we  have  S * (16,15),  U = (17,18,19)  and  B ■ (26,25,24).] 
Comment  6.  The  function  BAIANCE  - to  be  described  in  Section  4 - structures 
the  terms  of  U into  a binary  tree  to  be  denoted  as  HT^(U).  Each  of 
these  terms  is  stored  as  a node  in  the  search  data  structure  7\  a^ 
refers  to  the  node  storing  the  root  of  HT^(U),  and  is  itself  treated 


as  a "term" 
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Comment  7.  This  recursive  call  obtains  a tree  M^U),  again  referred  to 
through  its  root  in  a.^',  b(a^)  points  to  the  string  of  segments  which 
are  the  upward  continuations  of  ffl^U).  [in  our  example, 
in^CU)  * (24,25,26)  and  b^)  points  to  the  string  (28,29,30).] 

Comment  8.  If  both  HT^(U)  and  ffl^U)  are  nonempty,  they  are  joined  together 
into  a new  binary  tree  VT(U)  - referred  to  via  its  root  a - and  the 
upward  continuations  of  ffl^U)  become  the  upward  continuations  of  VT(U) 
[in  our  example,  VT(U)  * ((17,18,19)  * (24,25,26))];  otherwise,  when 
is  empty,  the  string  3 itself  gives  the  upward  continuations. 
Notice  that  is  empty  only  when  in  Step  14  we  have  a call 

ORGANIZERS, k, k) , i.e.,  when  j * k. 

Comment  9.  A new  "term"  (a,b(a))  is  formed  in  Step  16  and  pushed  into  S 

[in  our  example,  S becomes  (((17,18,19)  * ( 24,25,26)) , 16 , 15)] . Notice 
that  the  major  loop  4-18  is  repeated  until  j^  is  set  equal  to  k in 
Step  17,  which  occurs  exactly  just  after  Step  14  returns  the  empty  tree. 
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In  our  example,  ORGANIZE  ((15, 16,17,18, 19, 20, 21, 22,35), 4,8)  produces  the  tree 
a - ( 15((  16, (<  17, 18, 19) *(  24,25,26) ) ,20)*<«27,28)*( 32,33,34))  ,29, 30»  ,21,22,35); 
b(a)  points  to  the  string  (36,37,38,39,40,41). 

height 


0 

Figure  2.  Illustration  for  the  procedure  ORGANIZE 
V? 

Once  the  function  ORGANIZE  Is  available, denoting  by  the  string  of  the 
initial  segments  of  the  edge  string  the  construction  of  the  search 
structure  J for  graph  G is  trivially  done  by  the  single  call 
ORGANIZE (£Q,0,n-l).  In  our  example,  JCQ  - (1,2,3). 

The  construction  of  subtrees  occurs  in  Steps  14,  15,  and  16.  As  we  noted 
there  are  two  types  of  subtrees  - H-trees  and  V-trees  -,  depending  upon  the  way 


they  are  generated.  The  root  of  a V-tree  is  said  to  be  a V-node,  while  all 
others  are  referred  to  as  H-nodes. 
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A subtree  is  said  to  contain  a segment  t If  t has  been  assigned  to  one 
of  its  nodes;  a subtree  Is  said  to  contain  a vertex  v of  G If  v is  in  the 
interior  of  the  trapezoid  which  is  the  convex  span  of  the  points  of  the 
segments  contained  in  the  subtree;  for  each  subtree  we  define  its  slab  as 
the  smallest  horizontal  planar  strip  where  all  of  the  tree's  segments  lie. 

We  begin  by  discussing  V-trees.  Subtree  VT(U)  is  obtained  by  "joining" 
together  two  H-trees,  HT^(U)  and  HT2(U).  Notice  that  slab(HT^(U))  - [h,j] 
and  slab(HT2(U))  ■ [j,mln(2j-h,k)]  are  adjacent;  HT^U)  and  HT2(U)  are  joined 
by  means  of  a V-node,  which  is  assigned,  as  a discriminant,  the  ordinate  of 
the  horizontal  line  separating  the  two  adjacent  slabs  (for  example,  in 
Figure  2,  (27,28)  and  (32,33,34)  are  separated  by  the  horizontal  line  y - 7). 
Notice  that  in  executing  0RGANIZE(£Q,0,n-l),  the  condition  k < 2j-h  in  a 
recursive  call  (Step  15)  may  occur  only  for  k ■ n-1;  since  j-h  ■ (2j-h)-j, 
for  the  two  adjacent  slabs  we  have: 

Proposition  3.  width(slab(HT1(U)))  * width(slab(HT2(U))) . If  the  Inequality 
is  strict,  then  slab(HT2(U))  is  upper  bounded  by  the  line  y ■ n-1. 

We  define  the  level  of  VT(U)  as  log  max  (width(slab  HT. (U)))  + 1.  Since 

1*1,2  1 

a slab  of  width  w (an  integer)  contains  exactly  w-1  vertices  of  G in  its 
interior,  we  obviously  have  that  a V-tree  of  level  1 contains  at  most  2^-1 
vertices  of  G.  On  the  other  hand  any  V-tree  - except  possibly  one  whose  slab 
is  upper  bounded  by  y ■ n-1  - contains  at  least  one  vertex  of  G.  Thus  we  have: 
Proposition  4.  All  V-trees  - except  possibly  one  - contain  at  least  one 
vertex  of  G;  if  level  (T)  ■ i,  then  T contains  at  most  2*-l  vertices  of  G. 


The  number  of  V-nodes  is  obtained  as  follows.  Let  P*[t|t  is  a V-tree  and  there 
is  no  other  V-tree  T'  which  is  a proper  subtree  of  t).  The  root  of  any  T 6 P is 
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Ch«  only  V-nod*  in  T,  otherwise  there  would  be  7- tree  T'  which  is  a subtree  of  T. 

The  cardinality  of  P is  at  most  (n-2),  since  vA  and  v are  not  contained  in 

u n 

any  V-tree.  For  each  T € P,  suppose  to  trace  the  path  from  its  root  V to  the 

root  of  J and  let  V-,V,,...,V  be  the  sequence  of  the  V-nodes  encountered; 

* J P 

Vj  is  the  root  of  some  V-tree  Tj,  and  obviously  level  (T)  < level  (T^)  < ... 

< level  (Tp).  Since  the  level  of  any  V-tre*  is  upper-bounded  by  Tloghl  , and 
|P|  S n-2,  we  conclude  that  the  number  'ii  V-nodes  is  upper-bounded  by 
O(nlogn). 

We  now  consider  the  other  type  of  subtrees,  the  H-trees.  They  are  formed 
by  structuring  (Step  14)  into  a binary  tree  a mixed  sequence  U of  segments  and 
V- trees , all  spanning  the  same  horizontal  slab  (e.g.,  16,  <(17,18,19)*<24,25,2o  , 

and  20  in  Figure  2).  In  general  U is  the  form  * *Tr-l^rTr’  w*lere 

Tj's  are  V-trees  of  Identical  level  and  TQ,...,Tr  are  each  a string  of 
segments;  we  claim  that  none  of  the  strings  T1,...,Tr_1  Is  empty.  To 
prove  this,  notice  that  each  T^  is  the  join  of  two  H-trees  and  Hjj. 
if  rl  is  empty,  for  IS  IS  r-1,  the  procedure  ORGANIZE  would  combine 

the  member a of  and  into  a single  tree,  before  examining  the 

members  of  and  Hj  i+1,  thus  contradicting  the  existence  of  T^  and  T1+1. 

The  nodes  created  in  structuring  U are  H-nodes  and  to  each  one  of  them  we 
assign  one  of  the  segments  in  tq  U Tj^  U ...  U Tr,  and  a (discriminant) 
linear  function  f(x,y),  so  that  f(x,y)  - 0 is  the  equation  of  the  line 
containing  that  segment.  The  details  of  the  construction  of  HT^U)  and 
HTj^)  - by  the  subroutine  BALANCE  - will  be  discussed  in  the  next  section 
in  connection  with  the  performance  analysis  of  the  method;  presently,  we  just 
note  that  the  number  of  V-nodes  (i.e.  of  V-trees)  involved  in  the  structuring 
process  is  Just  0(n),  rather  than  O(nlogn). 
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To  see  this,  we  reduce  the  tree  J to  a tree  J, v which  contains  only  V-nodes 
and  1b  constructed  as  follows:  delete  and  bypass  all  the  H-nodes  of  J one  at  a 
tine,  l.e.,  for  each  non-leaf  H-node  V replace  the  three  arcs  (FATHER(V),V), 
(V,LEFTSON(V)),  and  (V,RIGHTSGN(V))  with  the  two  arcs  (FATHER (V ) ,LEFTSON(V) ) and 
(FATHER (V ) , RIGHTSON (V ) ) ; a leaf  H-node  Is  just  suppressed.  Clearly  J has  at  most 
(n-2)  leaves.  The  nodes  of  are  of  three  types:  the  regular  ones  with 
two  or  more  "children",  the  singular  ones  with  exactly  one  child, and  the 
leaves ; It  is  clear  that  only  the  children  of  regular  nodes  take  part  in  the 
balancing  process.  Therefore  suppose  now  to  further  delete  and  bypass  every 
singular  node;  the  resulting  tree  Is  such  that  its  non-leaf  nodes  have  at 
least  two  children  and,  since  there  are  at  most  (n-2)  leaves,  there  are  at 
most  2n-5  nodes  altogether.  This  proves  the  claim. 

If  we  represent  H-nodes  by  the  symbol  (t)  , where  t is  the  number  of 
the  segment  assigned  to  the  node,  and  V-nodes  by  the  symbol  \y’  , where  y is 
the  ordinate  assigned  to  the  node,  the  structure  J for  the  graph  of  Figure  2 
is  shown  in  Figure  3. 


Figure  3.  The  binary  search  tree  J for  the  graph  of  Figure  2 
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4.  Performance  analysis  of  the  method 

We  begin  by  evaluating  the  deptji  of  the  search  tree  J.  Clearly  J has 

O(nlogn)  nodes.  In  fact  the  H-nodes  are  in  a one-to-one  correspondence  with 

the  set  of  segments,  and  the  latter  has  cardinality  O(nlogn);  as  to  the 

V-nodes,  we  have  Just  shown  that  their  number  is  also  O(nlogn). 

If  J were  balanced,  it  would  have  depth  O(logn).  However  there  is  no 

explicit  provision  in  the  ORGANIZE  algorithm  to  achieve  such  property;  as  a 

matter  of  fact,  the  depth  of  J critically  depends  on  the  subroutine  BALANCE 

used  for  structuring  H-trees.  Indeed,  suppose  that  in  Step  14  of  ORGANIZE,  the 

set  U contains  0(n)  V-trees.  The  increase  in  depth  produced  by  BALANCE (U) 

2 

could  be  O(logn),  thereby  resulting  in  an  O(log  n)  depth  for  J.  However, 
we  shall  now  describe  a procedure  BALANCE  which  produces  a global  O(logn) 
depth  for  J.  The  procedure  is  based  on  the  following  lemmas  (the  first  of  whir'-' 
is  a variant  of  another  lemma  presented  in  [9  ]): 

Lemma  1.  Let  <7  ■ a^j.^a  be  a string  with  p > 1 and  let  the  positive 

p 

integer  |a.|  denote  the  weight  of  a ; also,  let  |<7|  - |a  | and  M - max  |a.|. 

J j1  J j«l  J 

Then  for  any  number  M £ m < |tf| , the  string  <7  can  be  algorithmically 

partitioned  as  <7  so  that  \d2\  < m,  |tf|3  < m,  and  \<72\  + \<7j\  > m. 

Proof;  Arrange  the  terms  of  3 as  the  leaves  of  a balanced  binary  tree 

t(0)  and  for  each  node  V of  this  tree  t(<7)  compute  the  weight  |v|  as 

|LEFTSON(V)[  + |riGHTSON(V)  | ; obviously  |R00T(t(<7»  | - \d\ . If  we  trace  a 

path  from  the  root  of  t(<7)  following  at  each  node  the  branch  of  larger  weight, 

the  weights  of  the  traversed  nodes  form  a decreasing  sequence  whose  minimum 

is  guaranteed  to  be  no  larger  than  M.  Thus  there  is  a unique  node  V*  on  this 

path  such  that  |v*|  > m,  |LEFTSON(V*) | < m, |RIGHTSON(V*) | < m.  We  then  let 

&2:m  9tring  of  leaves  of  LEFTSCN(V*),  3^'m  string  of  leaves  of  RIGHTSON(V*) , 

while  (7^  and  <7^  are  the  (possibly  empty)  prefix  and  suffix  of  3.  □ 
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For  a V-tree  T we  define  its  weight  1 1|  as  the  number  of  vertices  of  G 

contained  in  T (recall  that,  by  Proposition  4,  (level (T)  - 1)  » (|t|  * 1)). 

An  X_-_gtring  has  the  form  U - toTit1*  * ,Tr-lTrTr’  where  the  Tj's  are  V-trees  of 

identical  level  l > 1 and  the  t^'s  are  (possibly  empty)  strings  of  segments. 

r 


We  define  the  weight  | u|  of  U as  E |t.|. 

j-1  j 

LemfLi.  Let  VT  - JODKHT^H^).  The  trees  HT1  and  HT^  can  be  algo- 
rithmically constructed  so  that  depth(VT)  < Tlognl  + 21og|VT|  + 3level(VT)  - 1. 

Proof:  For  simplicity,  let  6 (T)  - depth(T)  - Tlognl  . We  make  the 
following  inductive  hypotheses: 

PI.  If  U is  a j-string  with  0 < | u|  < Kj  < K,  then  6 (U)  < 21og|u|  +3j  + l; 
P2.  If  T is  a V-tree,  with  | t|  < K and  level (T)  - j < £,  then 
6(T)  < 21og|  t|  + 3j  - 1. 


The  induction  can  be  started  with  j-1.  In  fact  level(T)  - 1 implies 
|t|  - 1,  i.e.,  if  T - JOINO^.I^),  Hx  and  «2  are  each  trees  of 
0(n  ) segments,  so  that  6 (tt±)  < 0 (i  - 1,2)  and  6 (T)  < 1 < 2.  Also,  if  U 
is  a 1-string,  its  corresponding  slab  has  width  2.  It  follows  that  | u|  < 1 
and  either  U - t'Tt"  or  U - t,  where  are  strings  of  segments  and 

level (T)  - 1;  in  either  case  6 (U)  < 4. 

Proof  of  PI.  Let  U - T0T1T1* * ‘VlVr  and  I ul  " V Notice  that 
depth  (T^)  < Tlognl  (i.e.  6(t^)  < 0,  for  every  0 < i < r)  and  let 

IT 

| T | - max  | T | . 
s'  j.j.  ' j' 

(i)  |Ts|  > K^/2.  We  express  U as  where  both  and  U2  are 

j-strings  (with  (Uj  , |u2|  < 1^/2)  and  tj  and  t2  are  segments.  Since 
I TJ  1 < K and  level(Ts)  » j,  by  P2  we  have  6(Tg)  < 21og|Ts|  + 3j  - 1 

< 2 log  Kx  + 3j  - 1.  With  regard  to  1^(1 -1,2),  either  | uj  - 0 (in  which  case 


k 
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Ut  consists  of  segments  and  6(1^)  < 0)  or  by  PI  6(1^)  < 21og|ut|  + 3J  + 1 
< 21og  K^  + 3j  - 1.  Clearly  the  tree  in  Figure  4a  structures  U so  that 
6(U)  < 2 log  K.  + 3 J + 1 . 


(a)  (b) 

Figure  4. 


(2)  | Tg|  < Kj/2.  We  apply  Lemma  1 to  the  string  U with  m-K^/2.  We 
obtain  the  decomposition  u1t1u2t2U3t3U4»  where  ti»t2»t3  are  segments  and 
the  Uj's  are  J-strings  with  |U2|  ,|u3|  < K^/2,  and  | uj  + | U3|  > K^/2. 

The  latter  implies  |uj  + | uj  < 1^/2,  i.e.,  lujjuj  < 1^/2.  By  PI 
6(V1),6(U2),6(V3),6(U^)  < 21og(K1/2)  + 3j  + 1 - 2^1^  + 3J-1.  Then  clearly, 
the  tree  in  Figure  4(b)  structures  U so  that  6(U)  < 21ogK1  + 3j  + 1. 

Proof  of  P2.  Let  |VT|  - K and  level(VT)  - £,  with  VT  - J0IN(HT1,HT2) , 
|aril  - Kl,  I HT2|  - 1*2 , (Kj^  + Kj  - K). 

We  must  now  distinguish  two  cases : 

(1)  the  root  of  VT  is  a regular  V-node.  In  this  case,  0 < < K. 

Consider  HT^  (an  analogous  argument  holds  for  HT2).  The  set  U is  an 


1 


(l-l)-string  TqTjT^. . ,Tr_iTrTr * with  I ul  - < K and  level(U)  - 1-1.  Then, 

by  PI,  we  have  6 (U)  < 21og|u|  + 3(1-1)  - 1 = llogl^  + 31  - 2.  It  follows 
that  6 (VT)  - maxtf^MOn^))  +1  < 2max(logK1> log^)  + 31  - 2 + 1 < 21ogK+3!-l. 

(2)  The  root  of  VT  Is  a singular  V-node.  In  7 , there  is  a sequence  of 
V-trees  Tq(»  VT),  T^,...,Tp,  such  that  T^  is  a subtree  of  T^  ^ (for  i*l,...,p), 
the  roots  of  TQ,...,Tp_^  are  singular  V-nodes,  while  the  root  of  Tp  is  regular, 
and  | tJ  < K.  Clearly  level  (Tp)*l-p,  whence,  by  the  proof  of  case  (1), 

6(Tp)  < 21og|Tp|  + 3(l-p)-l.  Now,  notice  that  T^(l  < j < p)  is  contained  in 
an  (l-j)-string  of  the  formT'TjT",  whence  6(Tj)  < 6(T^+1)+3.  It  follows  that 
6 (VT)  < 6(Tp)  + 3p  < 21og|Tp|  + 3(l-p)-l  + 3p  < 21ogK  + 31-1,  since  |Tp|  < | VT|  = K. 
The  proof  is  thus  completed.  □ 

c 

In  conclusion  we  have: 

Theorem.  The  depth  of  the  binary  tree  J is  less  than  6 T lognl  . 

Proof : If  the  root  of  7 is  a V-node,  then  7 Is  a V-tree  of  level  Tlognl 
and,  by  lemma  2,  depth  (7)  < Tlognl  + log(n-2)  + 3rioghl-l  < 6f"lognl-l.  If 
the  root  of  J is  an  H-node,  then  there  is  one  edge  in  G between  and  vn_^» 
and  (n-1)  is  a power  of  2.  In  this  case  G appears  as  G^tG2  where  both  G^ 
and  are  graphs  with  no  more  than  n vertices;  G^  and  can  be  structured 
into  binary  trees  and  respectively,  whose  roots  are  V-nodes  and  heights 
are  less  than  6flognl-l.  It  follows  that  the  tree  structuring  G has  depth  less 
than  6flognl  . □ 


We  shall  now  estimate  the  running  time  of  the  procedure.  First  we 
consider  the  global  work  performed  by  the  BALANCE  subroutine,  described  in 
the  proof  of  Lemma  2.  If  U contains  r V-trees  Tj^T^ . . . ,Tr,  then,  using  a 
result  of  [10],  the  balancing  runs  in  time  O(rlogr).  As  we  have  shown,  the 
total  number  of  V-trees  involved  in  balancing  operations  is  0(n),  whence 
O(nlogn)  is  the  overall  running  time  of  BALANCE. 
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W6  shall  now  evaluate  the  running  time  of  the  procedure  ORGANIZE.  We 
have  just  shown  that  BALANCE  uses  O(nlogn)  steps  altogether;  analogously 
JOIN  runs  in  time  proportional  to  the  number  of  V-nodes,  i.e.,  in  time  O(nlogn). 

The  remaining  work  is  conveniently  charged  to  the  individual  segments.  Specifi- 
cally, each  segment  is  transferred  from  £ to  S (Steps  5 and  8),  and  then 


from  S to  U (Steps  11  and  12);  clearly,  the  work  expended  in  these  transfers 
is  bounded  by  a constant.  When  a segment  x is  transferred  from  £ to  S we 
associate  with  it  a pointer  b(x)  (Step  8)  to  the  string  L(x)  of  its  upward 
continuations.  The  construction  of  L(x)(Step  7)  takes  time  proportional  to 
its  size,  so  that  the  global  work  which  is  done  in  Step  7 is  proportional 
to  the  number  O(nlogn)  of  segments;  the  construction  of  b(x)  takes  constant 

> 

time.  In  summary,  a segment  x is  transferred  from  an  original  segment  list 
to  some  list  L(t)  of  "upward  continuations"  of  some  other  segment  t and  from 
here  to  a stack  B;  from  B it  is  next  transferred  to  S and  finally  to  U: 
clearly  the  total  work  involved  per  segment  is  bounded  by  a constant,  and 
since  there  are  G(nlogn)  segments,  also  this  portion  of  the  work  is  O(nlogn). 

We  conclude  therefore  that  the  running  time  of  ORGANIZE  (£^,0,0-1)  is  O(nlogn); 
that  the  space  used  is  also  O(nlogn)  is  straightforward. 

5 . Point  location 

To  locate  a point  Pq  * (Xg,yg)  the  planar  subdivision  induced  by  G,  we 
use  7 as  a binary  search  tree.  With  each  H-node  of  J which  has  one  or  no  des- 
cendant we  append  one  or  two  leaves,  respectively,  and  with  each  such  leaf  we 


associate  the  identifier  of  a planar  region  (bordering  with  the  edge  associated 
with  the  parent  H-node).  The  point  location  proceeds  as  follows:  at  each  node 
V of  7 we  choose  a branch:  if  V is  a V-node,  by  comparing  y^  with  y(V);  if  V 
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is  an  H-node,  by  testing  the  sign  of  f(Xg,yg),  where  f(x,y)  is  the  discriminant 
function  of  V.  Thus  we  trace  a unique  path  from  the  root  to  a leaf  at  which 
point  the  point  location  is  completed.  By  the  preceding  discussion  this 
process  uses  a number  of  comparisons  bounded  by  the  depth  of  7,  i.e.,  bTlognl  . 

6.  Comments  and  Applications 

As  the  previous  analysis  indicates,  planar  point  location  is  simply 
done  in  time  O(logn)  using  a search  structure  which  can  be  stored  in 
O(nlogn)  space.  Specifically,  less  than  6flognl  comparisons  are  ever 
needed,  although  the  analysis  which  establishes  the  upper-bound  on  the 
depth  of  X is  overly  generous  and  a rultiplicatJive  constant  for  flognl 
substantially  lower  than  6 can  be  expected. 

As  to  the  storage  requirement,  the  analysis  refers  to  the  case  in 
which  each  of  0(n)  edges  is  partitioned  into  O(logn)  segments;  this 
intuitively  corresponds  to  a large  fraction  of  long  edges,  which 
presumably  is  not  the  average  case;  however,  graphs  can  be  constructed 
for  which  this  situation  occurs.  It  is  conceivable  that  the  simple 
approach  presented  in  this  paper  could  be  further  refined  to  achieve 
0(n)  storage  while  maintaining  O(logn)  search  time. 

Notice  that  the  described  point  location  method  is  not  restricted 
to  triangulations,  nor  to  planar  subdivisions  induced  by  straight-line 
graphs.  Indeed  the  straight-line  segments  may  be  replaced  by  other  curves 
if  the  following  two  properties  hold:  (i)  the  curves  are  single-valued 
in  one  selected  coordinate  (say,  y),and  (ii)  the  discrimination  of  a point 
with  respect  to  any  of  the  curves  can  be  done  in  constant  time.  For  example 
these  conditions  are  clearly  met  by  arcs  of  circle  or  of  other  conics  if 
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they  have  no  horizontal  tangent,  except  possibly  at  their  extremes.  We 
can  now  mention  two  applications  of  the  given  method.  Both  problems  have 
recently  received  consideration  in  the  literature  [11,12]. 

Fixed-radius  near  neighbor  searching.  This  problem  involved  finding 
all  points  of  a set  F in  the  plane  which  are  within  some  fixed  radius  r of  a 
"query  point"[ll],  Bentley  and  Maurer  have  recently  proposed  - among 
other  methods  - a locus  approach,  which  consists  in  subdividing  the  plane 
into  regions  each  of  which  is  the  locus  of  the  points  within  distance  r 
from  a given  subset  F'  of  F (this  region  is  clearly  the  intersection  of 
all  the  circles  with  radius  r centered  at  the  points  in  F’).  Let 
F ■ and  let  be  the  circle  of  radius  r with  center  in 

p^€F.  For  each  C^,  let  u^  and  be  the  two  points  on  the  circle  CL  with 
largest  and  smallest  ordinates,  respectively,  and  let  I denote  the  set  of 
intersections  of  pairs  of  circles  in  (c^J  i m 1, . . . ,n  }.  If  we  define 
V = lUfuJ  i * 1, . . . ,n}u{jiJ  i * 1, . . . ,n} , the  circumference  of  each  (L  is 
partitioned  into  a set  of  arcs  which  have  properties  (i)  and  (ii)  given 
above.  Therefore  V is  the  vertex  set  of  a planar  graph  G whose  edges  are 
the  arcs  just  described.  To  this  planar  graph  the  method  of  this  paper  is 

applicable.  Since  |v|  * |l|  + | {h^| i = 1 , . . . ,n) | + | { X ^ | i * 1, . . . ,n} | = 

2 

2(^)  + n + n = n(n+l),  graph  G is  planar  with  0(n  ) vertices.  Thus  fixed- 

radius  near-neighbor  searching  can  be  solved  in  O(logn)  time  with  a data 

2 2 

structure  using  0(n  logn)  space  and  constructible  in  0(n  logn)  time;  in  [11] 

3 

the  latter  two  quantities  are  both  0(n  ). 

2.  Maxima  testing  in  three  dimensions.  For  points  u and  v in  three- 

3 

dimensional  Euclidean  space  R , u is  said  to  dominate  v if  x^fu]  > x.Jv] 

3 

(i * 1,2,3).  Given  a finite  set  F of  points  in  R , u£F  is  a maximum  of  F 
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if  it  is  not  dominated  by  any  other  point  in  F.  Suppose  now  that  F is  a set 
of  maxima  of  F;  testing  a target  point  p for  maximum  in  F means  to  determine 
if  there  is  at  least  a point  u$F  which  dominates  p. 

Letting  |f|=  n,  Bentley  [12]  solves  this  problem  in  O(log^n)  time 
on  a search  data  structure  that  is  stored  in  O(nlogn)  space  and  is 
constructed  in  O(nlogn)  time.  We  now  show  that  the  same  storage  and 
preprocessing  time  can  be  maintained  while  reducing  the  test  time  to 
O(logn) . 

. n 

Let  F = lu^,...,unJ.  Let  v be  the  point  such  that  x^ [v]  = min  x^[u^] 

(j=l,2,3);  for  convenience  we  may  assume  that  v be  the  origin  of  ® , so  that  al? 

3 

points  of  F lie  in  the  positive  orthant  ]R+.  Let  M.  be  the  domain  of  points 
3 n 

of  H dominated  by  u €F,  and  let  M = U M . . Consider  now  the  surface  of  M 
+ i=l  1 

and  suppose  to  project  it  on  one  of  the  coordinate  planes,  say  (x^.x^).  This 
projection  appears  as  a planar  straight-line  graph  G,  each  finite  region 
of  which  is  the  projection  of  a portion  of  the  surface  of  M^,  for  some  i 
(Figure  5);  it  follows  that  if  the  (x^ .x^j-projection  of  the  target  point 


p falls  In  the  region  of  G associated  with  u^F,  then  the  maxima  testing 

reduces  to  comparing  x^[p]  with  x^[u^].  Thus  maxima  testing  is  done  via 

point-location  in  G.  Notice  now  that  G has  two  edges  - respectively  parallel 

to  the  xx  and  x2  axes  - issuing  from  the  (x^x^-projection  of  each  u^F. 

It  is  easy  to  realize  that  the  point-location  procedure  can  be  applied  to 

the  graph  consisting  of  the  n edges  parallel  to,  say,  the  x^axis,  and  the 

positive  x2-axis  itself  (see  Figure  5).  Obviously  the  search  data  structure 

can  be  stored  in  O(nlogn)  space  and  is  constructible  in  O(nlogn)  time. 

Referring  to  the  arguments  of  Bentley  [12],  the  time  for  worst-case  maxima 

k-1  k-2 

testing  in  k dimensions  can  be  reduced  from  O(log  n)  to  O(log  n)  for 


I 
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